登入 Databricks Community Edition (CE) Portal 後,參考 Getting Started document,建立一個 cluster。
%sql
DROP TABLE IF EXISTS diamonds;
CREATE TABLE diamonds USING CSV OPTIONS (path "/databricks-datasets/Rdatasets/data-001/csv/ggplot2/diamonds.csv", header "true")
Run Cell!
%sql
SELECT color, avg(price) AS price FROM diamonds GROUP BY color ORDER BY COLOR
+
,選擇 Visualization
好的,目前已經驗證有個 Spark 環境可以執行 Notebook。